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q ! Abstract 

Channel state feedback schemes for the MIMO broadcast downlink have been widely studied in 
-^ the frequency-fiat case. This work focuses on the more relevant frequency selective case, where some 

important new aspects emerge. We consider a MIMO-OFDM broadcast channel and compare achievable 
ergodic rates under three channel state feedback schemes: analog feedback, direction quantized feedback 
and "time-domain" channel quantized feedback. The first two schemes are direct extensions of previously 
proposed schemes. The third scheme is novel, and it is directly inspired by rate-distortion theory of 
Gaussian correlated sources. For each scheme we derive the conditions under which the system achieves 
full multiplexing gain. The key difference with respect to the widely treated frequency-flat case is that 
in MIMO-OFDM the frequency-domain channel transfer function is a Gaussian correlated source. The 
new time-domain quantization scheme takes advantage of the channel frequency correlation structure 
and outperforms the other schemes. Furthermore, it is by far simpler to implement than complicated 
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quantization is actually needed for efficient channel state information feedback. 



spherical vector quantization. In particular, we observe that no structured codebook design and vector 
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I. Introduction 

We consider a MIMO-OFDM broadcast channel with one base station (BS), equipped with 
M antennas, and K > M single-antenna user terminals (UT). MIMO broadcast channels have 
been widely studied in the recent past (see for example [1], [2], [3], [4], [5]). Under perfect 
transmitter channel state information (CSIT) at the BS and receiver channel state information 
(CSIR) at the UTs, its capacity was fully characterized in [5] and efficient resource allocation 
algorithms have been proposed in order to operate at desired points in the capacity region (e.g., 
[6], [7], [8]). In the current standardization of the 4-th Generation of wireless communication 
systems (e.g., IEEE802.16m), MEMO broadcast schemes are going to play a fundamental role 
in order to achieve high data rates in the downlink. In practice, CSIT must be provided to the 
BS by some form of feedback. 

CSIT feedback schemes are a very active area of research (see for example [9] and the special 
issue [10] for a fairly complete list of references). In brief, we may identify three broad families: 
1) open-loop schemes based on channel reciprocity and uplink training symbols, applicable to 
Time-Division Duplexing (TDD); 2) closed-loop schemes based on feeding back the unquantized 
channel coefficients (analog feedback); 3) closed-loop schemes based on explicit quantization 
of the channel vectors and on feeding back quantization bits, suitably channel-encoded (digital 
feedback). Closed-loop schemes are suitable for Frequency-Division Duplexing (FDD), where 
channel reciprocity cannot be exploited. Most if not all present works deal with the case of 
a frequency-flat channel. In particular, it was recognized that the most important information 
about the channel vectors consists of their directions. Directional quantization is obtained by 
using vector quantization codebooks formed by unit vectors distributed on the M dimensional 
complex sphere. In [11], ergodic achievable rates are analyzed assuming linear zero-forcing 
beamforming (ZFBF) and random ensembles of spherical quantization codebooks, uniformly 
distributed on the unit sphere. These results have been extended in [9] to a variety of cases 
including realistic feedback channels with noise, fading and delay, and to non-perfect CSIR at 
the UTs obtained by explicit downlink training. In particular, these works show that the sum-rate 
scales optimally, as M logSNR + 0(1), provided that the number of quantization bits per UT 
increases with SNR as B = a(M — 1) log 2 SNR for some a > 1. For example, at SNR of 10 dB 
a codebook of size 1024 is needed for M = 4 antennas, and a codebook of size 2 24 = 16777216 



is needed for M = 8 antennas. Clearly, such channel vector quantizers involve an enormous 
computational complexity unless some special structure is exploited. Structured spherical vector 
quantizers for direction quantization have been studied, for example, in [12]. 

The frequency- selective (OFDM) case is more directly relevant to 4-th Generation wireless 
systems. A trivial solution consists of operating one independent CSIT feedback per carrier. This 
solution is suboptimal since it does not take advantage of the fact that the channel vectors at 
different carriers are correlated. In this paper we compare three channel state feedback schemes 
for the MIMO-OFDM downlink: analog feedback, digital direction quantized feedback and a new 
"time-domain" channel quantized feedback inspired by rate-distortion theory. For each scheme 
we derive the conditions under which the system achieves full multiplexing gain (i.e., the pre- 
log factor of the sum-rate is equal to M). The new rate-distortion inspired scheme takes full 
advantage of the channel frequency correlation structure and it is shown to outperform the first 
two. Furthermore, time-domain quantization is by far simpler to implement than complicated 
spherical vector quantization. In particular, it is seen that no structured codebook design for 
vector quantization is actually needed for efficient channel state information feedback. 

II. System Model 

For the sake of analytical simplicity, we do not consider users selection based on CSIT 
feedback information (e.g., as in [13], [14], [15]). Therefore, without loss of generality, we may 
assume that a set of M out of K users is selected at each time slot according to some scheduling 
scheme independent of the channel realizations. Also, we assume perfect CSIR at all UTs and 
focus solely on the CSIT feedback performance. Channels are identically distributed for all users, 
and spatially independent (no antenna correlation). Therefore, we focus on the description of 
the scalar channel between any BS antenna and a generic user, dropping antenna and user index 
for the sake of notation simplicity. A standard assumption in OFDM is that channels behave 
locally as linear time-invariant finite impulse response filters of length L. We assume block- 
fading channels, constant on blocks of duration T ^> L symbols, and changing according to 
some ergodic statistics from block to block. In this work we consider zero-delay CSIT feedback 
and block-by-block estimation. Therefore, we are not concerned with the time-correlation from 
block to block of the channel (the case of delayed feedback and explicit channel prediction is 
considered in [9]). Using the standard cyclic-prefix method, blocks of N = T — L+l information 



symbols can be transmitted without inter-block interference at the cost of a small dimensionality 
loss factor of (1 — ^-) ~ 1, that shall be neglected in the achievable rate expressions of this 
paper since it applies to all such OFDM schemes in the same way. 

After cyclic prefix insertion and removal the resulting channel model is defined by a block 
transmission of N symbols per transmit antenna, over the iV OFDM subcarriers. Letting h = 
[h[0], h[l], ..., h[L — 1]] T denote the discrete-time channel impulse response, the channel in the 

DFT frequency domain is given by H = [H[0], . . . , H[N - 1]] T , where H = v^F 



h 




and where F denotes a unitary N x N DFT matrix with elements [F]„£ = -J= e -J 27rfa /^ w ^ 
n = 0, ... , N — l, £ = 0, . . . , N—l. A common assumption consists of modeling the time-domain 
channel coefficients h[l]'s as independent Gaussian random variables ~ CN(0, of), where the 
path variances {oq, . . . , o\_^\ forms the Delay Intensity Profile (DIP) of the channel. We follow 
this model here, and re-discuss it in Section |Vl] where we show how to take advantage of a more 
physically motivated channel model. The frequency-domain channel covariance matrix is given 
by 
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where Yl h = diag(oQ, ...,a|_ 1 ). Furthermore, the diagonal elements of £# are equal to a 2 H 
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In the MIMO case, the channel from the BS to UT k is defined by the vector discrete-time 
impulse response [hfc[0], h fe [l], . . . , h k [L — 1]} where h kji [l] is the channel coefficient from the BS 
antenna i to the UT k at discrete-time delay I. By applying OFDM modulation and demodulation, 
the received signal at UT k on the n-th subcarrier can be written as 
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H%[n]x{n] + z k [n] (2) 



where k = 1, ..., K, n = 0, ..., N — 1, x[n] G C M is the transmitted vector of frequency-domain 
symbols on the M BS antennas, at subcarrier n, and H fc [n] = [H k: i[n], ..., H k>M [n]] T is the 
channel vector of UT k at subcarrier n. The average transmit power constraint is given by 

jjY.n=*nAn]\ 2 ]<y. 

For simplicity of analysis, this paper treats only the case of linear Zero-Forcing Beamforming 
(ZFBF). It is well-known that ZFBF performs at a fixed gap from the optimal capacity achieving 



strategy under perfect CSIT. Hence, our goal is to find conditions under which ZFBF performs 
at a fixed rate gap from the perfect CSIT case, which implies fixed rate gap from optimal. For 
perfect CSIT, the ZFBF transmitted signal at subcarrier n is given by x[n] = V[n]u[n] where 
V[n] G C MxK is a zero-forcing beamforming matrix with unit norm columns such that each 
k-th column v fc [n] is orthogonal to the subspace spanned by {H.j[n] : j ^ k}, and u[n] E C K 
denotes the vector of coded symbols, independently generated for the different UTs. In high 
SNR the uniform power allocation yields a fixed rate gap from the optimal (waterfilling) power 
allocation. Therefore, following [11] and [9], we restrict to this case and let E[u[n]u[n] H ] = -pi. 
Under these assumptions, the achievable rate at each UT k under ZFBF with perfect CSIT is 
given by 
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where Ei(n,x) = f™ ^-dt, x > 0, is the exponential-integral function. 

In the case of non-ideal CSIT, the BS uses the available channel information H fc [n], k = 
1, . . . , K, n — 0, . . . , N — 1, and computes the ZFBF matrix V[n] by treating H^fn] as if it was 
the true channel. The resulting received signal at the A;-th UT is 

y k [n] = H^ 1 [n] v fc [n] u k [n] + ^ H^ 1 [n] Vj [n] Uj [n] + z k [n] 

= a k)k [n]u k [n] + ^ a kJ Uj [n] + z k [n] (4) 

where a k j[n] denotes the coupling coefficient between the user channel h k [n] and the beam- 
forming vector Vj[n]. By following in the footsteps of the achievable rate bound in [9, Theorem 
2] we obtain that the achievable ergodic rate for user k is lowerbounded by R k > R kiC siT — AR k , 
where the rate-gap is upperbounded by 

with I k [n] = '^2j^k a k,j u A n ] indicating the multiuser interference term. An upper bound on the 
rate R k achievable with Gaussian random coding is also obtained in [9, Theorem 3] by assuming 
that a genie provides each UT k with exact knowledge of the signal and interference coefficients 
a k j for j = 1, . . . , M. This upperbound is referred to as the "genie-aided upperbound" and takes 



on the form 
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By dividing both lower and upper bound to the achievable rate by log(T/iVo) and letting 7 /No — ► 
oo, it is clear that a sufficient condition for achieving full multiplexing gain is that AR k is a 
bounded function of the SNR T/iVolj We shall examine this condition under different CSIT 
feedback schemes in the following sections. 

III. Analog Feedback 

Analog feedback consists of sending back the unquantized channel coefficients, transmitted as 
real and imaginary parts of a complex modulation symbol [16]. We model the feedback channel 
as AWGN, with the same SNR of the downlink, equal to 7/N . The more involved case of a 
fading MEMO multiple-access (uplink) feedback channel is treated, for the frequency-flat case, 
in [9], [16]. 

In order to take advantage of the channel frequency correlation, we partition the N subcarriers 
into J clusters such that N' = N/J is an integer, and feed back only the channel measured 
at frequencies n' = iN' for % = 0, 1, ..., J — 1. Each UT transmits its channel coefficients at 
frequency n' by using M' > M feedback channel uses per channel coefficient, for a total of M'J 
channel uses. This is achieved by modulating the channel vector H[n'] by a M' x M unitary 
spreading matrix [9], [16]. After despreading, the noisy analog feedback observation for UT k 
at frequency n' = iN' is given by 



g fc [i] = VPyHk [iN f ] + w k [iN'} (7) 

where (3 = M'/M > 1 and where w k [n'] E C Mxl is the AWGN in the feedback channel, with 
i.i.d. components ~ CN(0, No)). The BS performs linear MMSE "interpolation" based on the 
observation © for i = 0, . . . , J— 1 and compute the beamforming V[n] for each subcarrier based 
on the estimated channel. Since channels are spatially i.i.d., the BS can estimate independently 
each antenna for each UT. Therefore, without loss of generality, we focus on the side information 

'This condition is actually stronger, since it requires constant rate gap from optimal. Strictly speaking, full multiplexing gain is 
achieved if ARt is o(log(T/iVo)). However, in the cases analyzed in this work either AR^ is bounded, or it is 0(log(CP/iVo)), 
therefore this option is irrelevant in this context. 



and estimation of antenna m of UT k. By stacking the feedback observations, we form the vector 
gfe.m = [0*,m[O], • • • , 9k,m[J - 1]] T given by 



gk, m = V /3TSH fcjm + w fc , m (8) 

where Hfc >rn = [iffc )m [0],.fffc jm [l], ..., Hk t m[N — 1]] T , Wfc >m contains the AWGN samples and S is 
a J x N sampling matrix defined by [S], >n = 5 n=i N', for i — 0, . . . , J — 1 and n = 0, . . . , N — 1. 
By letting p = (37 /N , the MMSE estimator of H fe>m from g fcm is given by 

H fc , m = A /^SHS H (pSSHS H H-l) _1 g fc , ro (9) 
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where £# is defined by £Q). The corresponding MMSE covariance matrix is given by 

S e = £H-pEtfS H (l + pS£HS H ) -1 S5]tf (10) 

Our main result on analog feedback is summarized by the following: 

Theorem 1: The achievable rate gap of MIMO-OFDM ZFBF with analog CSIT feedback as 
described above is upperbounded by 
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where {om : / = 0,...,L — 1} are the DIP components arranged in decreasing order, 2 = 
min{ J, L} and {A(j) : % = 0, . . . , z} are the non-zero eigenvalues of the matrix QS/,a H arranged 
in increasing order, where ex. is the leftmost J x L block of the matrix SF. 

Proof: See Appendix H ■ 

In particular, if z = min{J, L} = L, as 7/N — ► 00 the rate gap is upper bounded by the 
constant 
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A fair comparison of digital and analog CSIT feedback schemes is provided by the achievable 
rate gap versus the number of CSIT feedback channel uses. For example, the above analog 
feedback scheme makes use of M'J feedback channel uses. We generally express our results in 
terms of the normalized number of feedback channel uses per antenna, i.e., through the coefficient 
ctfb > 1 such that a^M is the total number of feedback channel uses per user per frame. 



IV. Directional Vector Quantization 

We consider directional quantization based on random vector quantization (RVQ) codebook en- 
sembles, as in [11]. Each UT has a randomly generated quantization codebook C = {ci, ..., c 2 s} 
consisting of 2 B codewords independently and isotropically distributed on the M -dimensional 
unit complex sphere. In order to reduce the number of feedback bits, several current system 
proposals consider to cluster the subcarriers and feedback the quantized channel only for one 
representative frequency for each cluster, as done for the analog feedback scheme considered in 
Section Hn] (see for example [17] in the single-user MIMO-OFDM case). Since it is not clear 
how to interpolate the direction information over the subcarriers, a common approach consists 
of assuming that the channel is constant over clusters spanning less that the channel coherence 
bandwidth, and use a piece-wise constant beamforming matrix, computed from the CSIT at the 
center subcarrier in each cluster. We analyze this "piecewise constant" approach in terms of 
achievable rate gap. We consider again a grid of J equally spaced frequencies as before. On 
each such frequency n', the quantization of the channel vector H^n/] obeys the rule 

H fc \n'\ = arg max ' kl J [ 9 J| (13) 

k[ cee \ Hk [n'}\ 2 

The binary indices corresponding the selected quantization codewords {H fc [n'] : n' = iN',i = 
0, . . . , J — 1} are fed back to the BS over a perfect (error- free, delay free) digital feedback link, 
for a total of B tot feedback bits per UT. The total number of feedback bits per UT per frame is 
given by B tot = BJ. 

Using the MMSE decomposition, the channel vector at a subcarrier n ^ n' in the same cluster 
of n' can be written as 

H k [n} = il k [n}+e k [n,n'} (14) 

where iL k [n] = c[n,n']H k [n'] and where we define the channel correlation coefficient between 
subcarriers n and n! as 

E[g fc|m [n]g fc , m [nT] Et'o 1 ° 2 i e^n~n>),N 

The corresponding MMSE is given by of[n, n'\ = crjj(l — |c[n,n']| 2 ). The ZFBF matrix V[n'] 
computed from the quantized channels Hi[n'], . . . , Hy [n 7 ] is used for all subcarriers n in the 
cluster of adjacent frequency indices {n' — a, . . . , n'+b}, taken modulo N because of the circulant 



statistics of the frequency-domain channels, where a = N'/2 — 1,6 = N'/2 if N' is even and 
a = b = [N'/2\ if N' is odd. Our main result with this form of quantized feedback is given by 
the following: 

Theorem 2: The achievable rate gap of MEVIO-OFDM ZFBF with digital channel state feed- 
back based on RVQ as described above is upperbounded by 
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where a, 6 have been defined above and where we define 
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Proof: See Appendix HB 
In order to express the total number of feedback bits B tot in terms of feedback channel uses, 
we make the optimistic assumption that the feedback link can operate error-free at capacity within 
the strict one-frame delay constraint. This assumption is justified in light of the achievability 
results of [9], where it is shown that a rate gap very close to this case can be achieved by using 
very simple practical codes and taking into account the feedback error probability. It follows 
that Btot bits can be transmitted in a^{M — 1) channel uses, g where a^, 
Expressing the rate gap in terms of a^, we obtain 
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We observe that the rate gap grows linearly with log(T/A^ ) unless we let J = N. Hence, 
providing only one direction quantized channel per subcarrier cluster does not take advantage of 
the channel frequency correlation in an efficient way, since the channel is not exactly piecewise 
constant in frequency. Eventually, for sufficiently large SNR, the channel frequency variations 
are such that the residual interference will dominate on all frequencies n ^ n' . Letting J = N 
and using B = B tot /N bits per carrier yields 
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2 For simplicity, we normalize here by M — 1 instead of M. This is justified by the fact that directional quantization does 
not include any information on the channel magnitude. Furthermore, our numerical results show that this slight bias against 
directional quantization does not yield any significant difference in the performance comparisons. 



which is bounded (or even vanishing with increasing SNR) as long as apo/N > 1. However, 
this choice may not be optimal for a given SNR. In practice, for given «fb and SNR, the system 
performance can be optimized by choosing the number of clusters J. The optimization of J is 
carried out numerically and generally depends on the operating SNR and on the channel DIR 
that determines the correlation coefficient c(S). 



V. Time-Domain Quantization 

The frequency-domain channel vector H fcm for a given BS antenna m and UT k can be 
regarded as a correlated Gaussian source with covariance matrix T, H . Letting H fcm = \/NFh k>m , 
where h fcm is the time-domain channel impulse response for UT k and BS antenna m, and 
noticing that F is an isometry, it follows that 
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where we let H fcm = y NFhk t m- It follows that the mean-square distortion for Hk, m is minimized 
by minimizing the mean-square distortion for h fc m . 



A. Rate -Distortion Limit 

Since the components of h.k, m are independent, we are in the presence of a set of L "parallel" 
Gaussian sources. The rate-distortion function for parallel Gaussian sources and mean-square 
distortion is given by [18] 
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where 7 is the solution of X^=o mm {7) °f} — D. The number of bits per symbol allocated 
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. Notice that if 7 > af, then 



to the quantization of the Z-th path is given by B { 

B[ = 0. This corresponds to the appealing and intuitive fact that more quantization bits should 

be allocated to the dominant paths. The bit allocation is usually referred to as reverse waterfilling 

(RWF). 

Under the (optimistic) assumption that the CSIT feedback can operate at the rate-distortion 
limit, our main result with this form of quantized feedback is given by the following: 
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Theorem 3: The achievable rate gap of MEMO-OFDM ZFBF with digital channel state feed- 
back based on time-domain quantization described above is given by 
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and the number of quantization bits per UT given by B 



ifc,i| 
MR(D) and R(D) is given in Q9J. 

Proof: See Appendix Hill ■ 

The superscript "KL" indicate the fact that the above approach corresponds to quantizing the 
Karhunen-Loeve transformed channel which corresponds to quantizing the time-domain channel 
vector hfc m , under the assumption of independent coefficients. We wish to study the high-SNR 
behavior of the rate gap upperbound in Theorem |3l in order to determine conditions under which 
the full multiplexing gain can be attained. We have the following result: 
Corollary 5.1: In high SNR regime, the rate gap (1201) can be relaxed to: 

af""— = l„g(l + ^^i2-<W^ (2!) 

Proof: See Appendix [IV] ■ 

In order to relate the number of feedback bits to the number of feedback channel uses, we 

make again the assumption that the feedback link can operate error-free at capacity. With a total 

of apoM = ^ — ( fl°y /jV ■, feedback channel uses per UT per frame, we let R(D) = B tot /M in 

(ED and obtain 

jKL.RWF.Ltaut ^ . (. 2 I-l/r !-W^ 



AiV • < log|l + ^— (^-J ] (22) 

It follows that the rate gap is bounded if a^/L > 1 and it vanishes when the inequality is strict. 



B. Scalar Uniform Quantization 

Achieving the rate-distortion limit requires, in general, grouping many source symbols into 
large blocks and performing optimal vector quantization. On the other hand, the CSIT feedback 
must have very low delay, and the L channel path coefficients must be quantized and sent 
back on each slot of T channel uses in order to enable the BS to compute the downlink 
beamforming matrix. Hence, optimal source coding and low feedback delay are two contrasting 
issues. Fortunately, it is well-known that simple scalar quantization achieves essentially the same 
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distortion versus SNR behavior, within a constant factor. Here we exploit this fact and consider 
a simple practical implementation of the above time-domain quantization scheme, where each 
UT performs uniform scalar quantization on real and imaginary part of its channel coefficients. 
Real and imaginary parts of each channel coefficient hk, m [l] are quantized independently with 
[Bi/2\ bits, where Bi is obtained, for example, by RWF or by some bit-allocation scheme aimed 
at minimizing the total distortion. The uniform scalar quantizer Q/ has Qi = 2^/^ quantization 
intervals of size A; > where Qi is an even integer, with thresholds 0, ±A/, ±2 A/, . . . , ±(Qi — 
2)A/2 and midpoint reconstruction levels ±A*/2, ±3A;/2, . . . , ±{Q t - l)A z /2. The Z-th path 
quantizer is obtained by choosing A; in order to minimize the quadratic distortion 

Ql/2-2 r{i+l)Al , A v 2 /-oo / A \2 

D(Q,,A,) = 2 J2 / (v-^i-^) f(v)dv + 2 (v-iQi-l)^) fWv 

-4 

where f{rj) = i—- e a i . The corresponding rate gap is upperbounded by 
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where Df u ® = minA,>o £> (Qi, A/). While for any finite Bi the optimization of A; must be 
carried out numerically and amounts to a simple line search, we can follow the analysis in [19] 
in order to capture the high-SNR behavior in closed form. If the total bit budget for quantization 
is large, we can assume that Bi 3> 1 for all I = 0, . . . , L — 1. Then, our goal is to set A; such 
that D (Qi,A[) = 2~ Bl , in order to have the same asymptotic behavior of the rate-distortion 
limit analyzed before. For a real Gaussian source with variance of /2 the following asymptotic 
upperbound holds [19] 

D(Q,,A,) < ^ + (Q iA t ) 2 P ovcr + (A?) (24) 

where the first term accounts for the so-called "granular distortion" and the second term accounts 
for the overload distortion, where the overload probability is given by 

Po™ = / A f{r])dr] < exp — 



By choosing A; = J lo J^ 2 Bi / 2 we obtain the desired mean-square distortion behavior that 
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decreases as Df VQ = ^-kBi2~ Bi + o(Bi2~ Bl ) where k « 6 is a constant independent of B t . In 
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particular, for uniform bit allocation Bi = B tot /(LM) and letting B tot = afbMlog 2 (l + ^/Nq) 
we obtain the upperbound 

k 6 V M N LMJ 

* lo 4 1+K ^^^UJ log2 ( 1+ ^)J (25) 

Hence, simple scalar uniform quantization yields a vanishing rate gap as long as a^/L > 1, 
which coincides with the condition for the rate-distortion limit of Corollary 15.11 On the other 
hand, this bound is not tight enough to capture the behavior for a^ = 1 (indeed, for a% = 1 
the bound yields a loglog(T/iVo) increase of the rate gap). 

In our numerical results we considered the optimization of the bit-allocation B\ subject to the 
constraint J2i=o ^>i = B tot . This is a classical integer programming problem, for which greedy 
solutions have been considered (e.g., see [20]). We omit the details of the allocation algorithm 
for the sake of space limitation here. However, it is apparent from the results of Section IVIll that 
RWF allocation comes very close to the more computational intensive greedy bit-allocation, and 
therefore it can be safely used in practice. 

VI. Exploiting the Physical Channel Structure 

While most analysis of OFDM systems assumes that the discrete-time channel impulse re- 
sponse h is formed by L independent Gaussian coefficients, this does not generally hold ex- 
actly. The commonly accepted Wide-Sense Stationary Uncorrelated Scattering (WSSUS) fading 
channel model [21] postulates that multipath components at different delays are uncorrelated. 
However, the delays of the physical channel are not, in general, integer multiples of the OFDM 
sampling frequency. In other words, while the continuous-time physical channel may obey the 
WSSUS model, the corresponding discrete-time channel has correlated coefficients. 

In this section we remove this unrealistic assumption and take advantage of the physical 

channel model. The continuous-time baseband channel impulse response can be written as 

p-i 

c ( t ^) = J2 c p^ 6 ^ T - T p^ (26) 

where c p (£) is a stationary Gaussian proper process with first-order distribution CN(0, fjl) and 
r p (t) is the p-th path delay [21]. Under the slowly time- varying assumption, r p (t) is assumed to 
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be independent of t for time intervals several order of magnitudes larger than the OFDM symbol 
duration, while c p (t) is assumed to be locally time-invariant over the channel coherence time, 
larger than the OFDM symbol duration. 

Let ip(t) denote the convolution of the transmit and receiving front-end filters (included in the 
D/A and A/D conversion). Then, the concatenation of filters and physical propagation channel 
around a reference time t is given by the convolution h(t;r) = i>(r) (8> c(t;r). By uniform 
sampling at rate l/W, focusing on an arbitrary reference time t — and neglecting the time- 
dependence because of the locally time-invariance assumption, we arrive at the discrete-time 

channel impulse response 

p-i 

h[l\=^2cp^([l-T p W]/W) (27) 

In matrix form, this can be written as h = \I/c where ^ G C LxP , c = (co, ..., cp_i) T and 
h = (h[0], ..., h[L — 1]) T as defined before. It is clear that in this case the covariance of h is not 
diagonal any longer, and it is given by S/j = ^T, C ^ H where S c = diag(/iQ, . . . , fip_x). 

Next, we state our main results on the achievable rate gap for analog feedback and "time- 
domain" quantized feedback by considering this more realistic channel statistics. We omit the 
proofs since they follow almost trivially into the footsteps of the previous results. It is however 
interesting to notice that the main effect of this more refined channel model is to replace L (the 
length of the discrete-time channel impulse response) by P (the number of physical multipath 
components). In practice, depending on the shape of 4>(t), we may have P significantly less 
than L. Hence, exploiting the knowledge of the physical channel (in terms of coefficients {c p } 
and delays {t p }) yields a clear advantage. In general, we assume that the multipath delays {r p } 
are known to the BS since they vary at a much slower rate and can be reliably tracked by the 
UTs and fed back at much lower duty cycle. Furthermore, the delays satisfy reciprocity even in 
FDD, and can be estimated by the BS using the uplink pilot symbols. 

For the case of analog feedback, we have: 

Theorem 4: The achievable rate gap of MIMO-OFDM ZFBF with analog channel state feed- 
back as described in Section [HI] is upperbounded by 

-P-z-l P-l r2 



-AF 



AR k = log 1 



M-\ 7 



k 5 M Nn 



p=0 p =p- z ^^ N A (p~P+z)_ 



2^ 6 l] + Z^ y ' 



(28) 
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where z = min{J, P}, {Xu\ : i = 0, . . . , z} are the non-zero eigenvalues of a\E , E c \E ,H a! H 
arranged in increasing order, and where {5?, : p = 0, . . . , P — 1} are the eigenvalues of \I/X c \E rH 
arranged in decreasing order. 

In particular, the rate gap is bounded as 7 '/N —* oo if J > P. In this case, it is upper 
bounded by the constant 



AR? = logl + ^— ^V^L| (29) 

The directional RVQ quantization scheme, that operates in the frequency domain under the 
assumption of piecewise constant channel, cannot take advantage from the physical channel 
knowledge. As for the "time-domain" approach, since h is a correlated vector we need to project 
it onto the appropriate Karhunen-Loeve basis in order to transform it into a set of independent 
"parallel" Gaussian sources. As a low complexity practical alternative, we consider also the 
direct quantization of the physical channel path coefficients c. 

We decompose £# as E# = U$U H where U is a unitary matrix and <fr is the diagonal 
matrix of eigenvalues. It follows that T, H has rank P < N, and we let 0q, . . . , 4 >2 P _ 1 denote its 
non-zero eigenvalues. Without loss of generality we can take U to be the tall Nx P matrix of the 
eigenvectors of E# corresponding to the non-zero eigenvalues. First, Hk, m is K-L transformed 
resulting in Cfc, m = U H Hfc /m . Then, RWF bit allocation is applied to the quantization of Ck t m- 
From the application of rate-distortion theory we have: 

Theorem 5: The achievable rate gap of MEVIO-OFDM ZFBF with K-L quantization described 
above, operating at the rate-distortion limit, is given by 

KL,RWF,Limit . ( _. M — 1 CP \ ^^ 

= log H D (30) 







Ai£ 


where D = E 


1 ~ ~" 12 
| c fc,l — c fc,l| 


, t 


and R(D) = } 


-jp=0 


logf_ 


+ 



, the number of quantization bits per UT given by B tot = MR(D), 
such that 7 is the solution of X^To 1 mm {7; 4> 2 p } — D. 
In high-SNR, using an approach similar to what was done in Section IV-AI and letting B tot = 
afbMlog 2 (l + y/No), we find the rate gap upper bound in the following simple and appealing 
form: 

AR k ^logll + oi^^^-J (31) 
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As already noticed, this shows that further performance improvement can be obtained by exploit- 
ing the structure of the physical channel. In particular, this is the case where L is considerably 
larger than P. 

Since the K-L transform requires an SVD of an N x N matrix, which may be computationally 
demanding for practical values of the OFDM symbol length N, we also consider quantizing 
directly the time domain coefficients, c k , m = [cfc, m [0], . . . , Ck, m [P — 1]] T - Letting c fci?Ti denote the 
quantized version of c fcm , we have 

-r-=-TQ,RWF,Limit _ , /. M— 17 r T ~ ,2"A 

AiV < log(^l + ^^-^-E[|*c M -*c M r]J 

- *(^w 

where D = J2p=o i'v^v w ^ tn D p = E [|c fejl [p] — c fcj i[p]| 2 ] and ijj p is the p-th diagonal element of 

The optimal time-domain quantization should consider a modified RWF bit-allocation that 
minimizes the weighted sum of distortions D = J2 P =o ^p^p- This can be straightforwardly 
done, and also a greedy bit-allocation can be applied to the case of scalar quantization. We 
omit the details for the sake of space limitation. It is interesting to notice that by applying 
the geometric-arithmetic mean inequality as in the proof of Corollary 15.11 and noticing that 
a 2 H = J2 P =o typttfr m e rate gap achieved by time-domain quantization is upperbounded by the 
same expression (1311) that holds for the K-L approach. This shows that the use of a K-L transform 
can only yield marginal improvements to the rate gap for high SNR. Therefore, we conclude 
that the time-domain quantization of the physical path coefficients provides a very attractive and 
low complexity solution for the CSIT feedback implementation. 

VII. Numerical results and conclusions 

We considered a MEVIO-OFDM system with M = 4 transmit antennas at the BS, K = 4 
single antenna UTs and N = 64 carriers. We assumed a discrete-time channel model with 5 
independent taps and DIP of {0.5, 0.24, 0.17, 0.06, 0.03}. Figs. [Hand |2] compare the lowerbounds 
and upperbounds on the sum rates for different feedback schemes as a function of aa, that quan- 
tifies the amount of total feedback channel uses per frame when SNR= lOdB. The lowerbound 



on the sum rate is calculated by R > i^(i? fc , csit — AR k ) where upperbound on the rate gap 
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is computed from (TTTT) for analog feedback, (fl~6l) for RVQ, (1201 for time-domain quantization 
and (1231) for scalar uniform quantization with both RWF and greedy bit-allocation (GBA). The 
upperbounds are computed by Monte Carlo simulation. The curve for RVQ corresponds to the 
optimal value of J obtained numerically for a given a ft . 

We notice that RVQ achieves the worst performance. We interpret this fact qualitatively by 
observing that with RVQ it is not clear how to exploit frequency correlation in an efficient 
way since the "interpolation" of the direction information over the subcarriers is not easily 
accomplished. On the other hand, if we augment direction information with (quantized) channel 
magnitude, we cannot outperform the rate-distortion inspired time-domain quantization, which 
treats directly the corresponding parallel Gaussian source in terms of mean-square distortion. In 
terms of order of decay for high SNR, scalar quantization of the time domain channel coefficients 
yields a very simple scheme that performs very close to perfect CSIT. Furthermore, time-domain 
scalar quantization is very simple to implement, and requires no complicated construction of 
spherical codebooks and vector quantization algorithms. Overall, also analog feedback with 
frequency-domain MMSE interpolation yields very competitive performance at low complexity, 
although its rate gap remains bounded and does not vanish as SNR increases. 

Next we considered the same system with SUT4 channel model given in [22] and omnidirec- 
tional antennas where the continuous-time channel model has 3 taps with path delays {0, 1.5, 4} 
fj,s and path variances {1,0.3162,0.1585}. ?p(t) is assumed to be a triangular pulse resulting 
from convolution of rectangular pulses corresponding to D/A and A/D (sample-hold) with width 
1/W = lfis. The lowerbounds and upperbounds on the sum rate can be computed similar to 
above. Figs. [3] and |4] compare the lowerbounds and upperbounds on the sum rates for different 
CSIT feedback schemes as a function of ««, when SNR= lOdB. We observe that time-domain 
quantization and K-L domain quantization perform very similar , in accordance with the rate-gap 
bound analysis done before. This shows that for any practical purpose there is no need of K-L 
transform. 

Finally, we considered the same SUI-4 channel model and compare two cases: 1) the trans- 
mit/receive pulse-shaping filter matrix ^ is known and 2) the matrix is unknown and the discrete- 
time channel coefficients are assumed to be independent while they are, indeed, correlated. Fig. 
|5] compares the upperbounds on the sum rates corresponding to different feedback schemes for 
these two cases as a function of a^ when SNR= lOdB. As it can be observed, knowledge 
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of masking matrix indeed improves the performance, even for such simple channel model and 
pulse- shaping filter. 



Appendix I 

From (OQ) we have that SE#S H = NaT,hCx. H where ex is the leftmost J x L block of the 
J x N matrix SF. Using this in (flOl) we can write 



N°^ 



tr (~Z h - pNY, h cx H (I + pNa'S h a. H ) 1 aS, 



tr S 



t , ^r^ 1 / 2 H \-il/2 



(33) 



where the last line follows from the matrix inversion lemma. Notice that Yi h is diagonal. We let 
{<Xm : £ = 0, . . . , L — 1} denote the sorted diagonal elements in decreasing order. Then, we let 
(A(j) : z = 0, . . . , z — 1}, with z = min{ J, L}, denote the non-zero eigenvalues of a.T, h cx H sorted 



in increasing order. The eigenvalues of the L x L matrix 
decreasing order, are given by 

1 



I 



piYSfaHaSf 



, sorted in 



1. 



1 + N P X {0) • 1 + iVpA^-i) 

Now, we use result H.l.g in [23, Ch. 9], stating that for any two n x n Hermitian positive 
semidefinite matrices A and B, we have tr (AB) < Y^l=i ^i(A)Aj(B) where A* (A) and A;(B) 
are eigenvalues of A and B sorted in the same order. It follows that 



1 



:trfS, 



L-z-l 



2 
V[l] 



L-l 



Er 



a 



[I] 



+ Np\ (l . 



(34) 



1=0 l=L-z ~ ' '■ r--v-L+z) 

For each UT k the channel estimation error on subcarrier n is given by e^fn] = Hfc[n] — Hfc[n] 
Since the noise and the fading process are spatially uncorrelated, we have that E [e fc [ra]e^[n]] = 



<7g [n]I, where a\ [n] is the n-th diagonal element of S e defined in (fTOT) . In particular, -^ X] n =o a 



n 



X^e 



cr: 



We use the rate-gap expression ©, and find 

E[|/ fc [n]| 2 ] 






n 



M 



M- 
M 



H*[n]vi[n] 



0V e 2 [n] 



eJJ[n]vj[n] 



M 



(35) 
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where the last line follows from the fact that H^[n]vj[n] = for any j ^ k from ZFBF, and 
that Vj[n] and ek[n] are independent, due to the fact that Vj[n] is a deterministic function of 
Hj[n] for i =£ j, and |vj[n]| = 1. Using this in © and using Jensen's inequality we obtain 

JV-l 

3g(l 



ARf < 



1 ^_ / M-l? 2 . 



N 



n=0 



M N 



< log 1 + 



M-l y 



-a; 



M JV n e 



(36) 



The desired expression (fTT|) follows from (|34|) . 



Appendix II 

We compute the variance of the interference term at frequency n, where we assume that n, n' 
are in the same cluster. Using known results on the average distortion of RVQ [11], we can 
write 

^ r n|2 



E[IW] 



= E E [I H " 



njVj-Ln 



M 



rtk 



c[n, n'} H k [n 1 ] + e k [n, n'] ) Vj [n' 



7_ 

M 



(6) 



£(|c[n,n']| 2 E |H fc [n']| 



/1|2 



E 



|H fc [n']| 2 



(c) „_^ / „ 0-B/(Af-l) 

< £ (|c[»,n']| a M<r|. 2 



+ rr|[n,n / ] ) -^ 



T 



M 



^(l-|c[n,n']| 2 )]- 



a^y |c[n,n / ]r2"^T + (l- |c[n,n']| 2 ) 



M-l 
M 



(37) 



where (a) follows from (fl4l) . (b) follows from the fact e fc [n, n'] is zero mean Gaussian independent 
of Hfc[n'] and Vj\n!\ and that norm and direction of H/Jn'] and iondependent, and (c) from ( 
Lemma 2 in [11]), the expression of the MMSE in terms of the correlation coefficient c[n,n'} 
and the fact that E [|H fc [n/]| ] = Ma 2 H since channels are spatially i.i.d.. 

The final result follows from © and from the fact that \c(n,n')\ 2 depends only on the 
difference 5 = n — n' and it is periodic of period N' . 
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Appendix III 

Let Hfc[n] denote the vector channel of UT k at frequency n, and Hfc[n] denote its reconstructed 
version obtained from the quantization of h^i, h^, • • • , iik,M- By replicating what was done for 
the analog feedback case, we have that 

E[\I k [n]\ 2 ] = ( -^^a![n] (38) 

where of[n] denotes the quantization error per antenna at frequency n. 
The rate gap for this case can be upperbounded by 

-KL,RWF,Limit 1 v^ , f __ M — 1 7 9r , 

-<JZ\n\ 



-r— p-KUKWl^Limit „ 1 V— \ , / „ 



TV ^ ° V M N, 



o 



n=0 

W . / I-lTl^ 1 
< login > (?;\n] 

\ M N ° N i^o 



I-1J 1 



logll + ^rr^^-^EllHM-H, 



M N N 

(6) . A M-1J 

= log H E 

s l M iVo 



|hfc,i — hfc,il 



w + ^jrfa* (39 > 



where (a) follows from Jensen's inequality and (b) from (fl8l) . 

Appendix IV 

In high SNR regime we have that a large number of quantization bits per symbol can be used, 
therefore 7 becomes small so that, eventually, 7 < min^ erf for all I = 0, . . . , L — 1. In this limit, 
all path coefficients are quantized with equal distortion 7. Therefore, D = L7 and from (|20l) we 
get 



-KL,RWF,Limit 



y m - 1 



Ai v < ^ ( 1 + Tv^r L7 ' (40) 



where 7 can be obtained from £19]) as 

/L-l \ Vi 

7 = 2-^ n^ (4i) 
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Next, we use the geometric-arithmetic mean inequality and write the loser, but more appealing, 
upper bound 

L-l \ V L , L-l 



Using this into (1401) . we arrive at (|2T|) . 



o / ;=o 
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Fig. 1. Comparison of lowerbounds and upperbounds on the sum rate for different feedback schemes with the discrete-time, 
uncorrelated path channel model when SNR is lOdB. 
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Fig. 2. Comparison of lowerbounds and upperbounds on the sum rate for different feedback schemes with the discrete-time, 
uncorrelated path channel model when SNR is lOdB. 
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Fig. 3. Comparison of lowerbounds and upperbounds on the sum rate for different feedback schemes with the continuous-time, 
uncorrelated path channel model when SNR is lOdB. 
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X 5 

To 

Q. 

£ 4 

03 
DC 



^ 3 



-e 



ZF Perfect CSIT 
KL,RWF,SUQ,UB 
9 - KL,RWF,SUQ,LB 
O KL,RWF,Limit,LB 
^TQ,RWF,SUQ,UB 
-TQ,RWF,SUQ,LB 
TQ,RWF,Limit,LB 



10 



20 



30 
a 



40 



50 



60 



fb 



Fig. 4. Comparison of lowerbounds and upperbounds on the sum rate for different feedback schemes with the continuous-time, 
uncorrelated path channel model when SNR is lOdB. 
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SNR= 10dB 
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Fig. 5. Comparison of upperbounds on the sum rate for different feedback schemes with the continuous-time, uncorrelated 
path channel model for known masking matrix vs. unknown matrix when SNR is lOdB. 



